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DATA COMPRESSION PRESERVING STATISTICAL INDEPENDENCE 
Georg E. Morduch and William M. Rice 


ABSTRACT 

The purpose of this study was to determine the optimum points of evaluation 
of data compressed by means of polynomial smoothing. It is shown that a set 
y of m statistically independent observations Y(tj), Y(t 2 ), . . . Y(t m ) of a 
quantity X(t), which can be described by a (n - l)th degree polynomial in time, 
may be represented by a set Z of n statistically independent "compressed 
observations" Z(r j ), Z{j 2 ), . . . Z(r n ), such that the "compressed" set Z has 
the same information content as the observed set Y. The times Tj , t 2 , . , . r n 
are the zeroes of an nth degree polynomial P n , to whose definition and properties 
the bulk of this report is devoted. The polynomials P n are defined as functions 
of the observation times tj, t 2 , . . . t m , and it is interesting to note that if the 
observation times are continuously distributed the polynomials P n degenerate 
to Legendre polynomials . 

The proposed data compression scheme is a little more complex than those 
usually employed, but has the advantage of preserving all the information content 
of the original observations . 
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PREFACE 

It had been our original intention to publish as a separate report the sections 
in "The Application of Data Compression to Orbit Determination, Old Dominion 
Systems Inc., January 15, 1973, Contract NAS5-11902" dealing specifically with 
data compression. However, after some discussions we realized that the data 
compression that was described in the above mentioned report did not neces- 
sarily yield a set of statistically independent "compressed" measurements. 
Although this in itself does not invalidate the method, there are obvious 
advantages if the compressed points are statistically independent. In developing 
the data compression technique described in this report we therefore made it a 
basic requirement that the "compressed" points should be statistically independent. 
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DATA COMPRESSION PRESERVING STATISTICAL INDEPENDENCE 

1.0 Introduction and Statement of the Problem 

1.1 Introduction 

Data compression and data smoothing is used where large volumes of data 
must be processed. Furthermore, it is necessary to weight the data properly 
in order to reflect the confidence in each measurement. Tracking data that is 
to be used for orbit determination and analysis is an example of where compres- 
sion and smoothing is required in order to achieve cost effectiveness and at the 
same time obtain "correct" results. 

The "correctness" of results is highly dependent upon the compression and 
smoothing technique employed. Careful application of smoothing and compression 
which will yield a single point representation of time varying data may be very 
accurate but it may also be very destructive of information content. 

All tracking data, to continue with our example, undergoes some transforma- 
tion in programs known as pre-processors, where, in general, calibrations are 
applied and smoothing and compression is performed. Operational preprocessors 
make use of least squares polynomial smoothing. Data compression is accomplished 
by evaluating this polynomial at one or more points. The uncertainties or weights 
associated with these smoothed data points are seldom obtained, yet they are 
required in order to properly use the data. Furthermore, one must be careful 
where the least squares polynomial is evaluated lest serious error be introduced 
(i.e. the end points are notoriously poor fitting). 

Although, as already mentioned, the main application of data compression 
is in the area of data preprocessing, the technique may also be applied within 
the main processing program. In this case the data being compressed is not the 
observational data, but the observational discrepancies, i.e. the differences 
between observed and predicted measurement values. (See "The Application 
of Data Compression to Orbit Determination" with particular emphasis on the 
processing of Minitrack observations. Contract NAS5-11902, Old Dominion 
Systems Inc., January 15, 1973.) 

In this report we shall describe a data compression technique, in which the 
compressed data points carry the same amount of information as the original 
data points. In order that this technique may be applied the following two criteria 
must be met: 
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1. The "noise” at each data point must be random gaussian and the "noise" 
at any two points must be uncorrelated. (Although the second criterion 
rarely applied for real data it is almost invariable assumed to hold true 
in practical orbit determination.) 

2. The "noise free" data must fit some polynomial of degree v - 1, say, 
in time. 

In order that no information be lost the number of compressed data points 
must be v. Statistical independence will, however, still be preserved if the 
number of compressed points is less than v . (In order not to introduce any 
errors into the compressed points it is, of course, necessary that the fitted 
polynomial be of degree v - 1 as before.) 

In essence, the technique developed will yield a set of compressed data 
points, which also satisfy the above two criteria, with the addional obvious 
requirement that the polynomial of the second criterion be the same as for 
the original set of points. 

In Section 1.2 below we define the problem in mathematical terms. In 
Section 2 we obtain the solution of the problem in terms of a set of orthogonal 
polynomials P , which are defined in Section 3. In this section we also derive 
all properties of the polynomials required in Section 2. In Section 4 we show 
that "underfit" (i.e. the effect of assuming that the polynomial of criterion 2 
above, is one degree lower than it really is) by itself does not introduce any 
error in the compressed data points. Finally, in Section 5 we establish that the 
zeroes of any two of our polynomials, whose degree differ by one, are interleaved. 
This result adds to our understanding of the behavior of these polynomials. It 
is also an interesting general result since for example Legendre Polynomials 
are but particular cases of these polynomials. 

1 .2 Statement of the Problem 

Given a set of data represented by the vector Y, can we find another vector Z, 
which will give an equally good representation of the data. 

It is assumed that Y is an N dimensional vector of the form 

Y T = (y 0 - y r •••■ Vi) (1,1) 

where each y. can be represented as a polynomial in t i of degree v - 1 plus 
some error or noise e i8 
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Thus 


where 


v- 1 

y. = / b + e. for 
1 / ■ ** 1 

fj.- o 


i = 0, 1, • • • N - 1 


and where 


< t t < t 2 


< . 


< t 


N“ 1 


E(e.) = 0 
E(e. e. ) = 0 if 
E(e. e.) = wT 1 , 



where E denotes "the expected value of". 
It is further assumed that 


j = 0, 1, ... N - 1 


and that 


w. >o for i = 0, 1, . . . N - 1 


v <, N 

Equation (1.2) may equivalently be written in the form, 

V - 1 

+ v for i = 0 « 1 >--- N - 

/x= 0 

where (t) is some polynomial in t of degree ^ . 

Equation (1.9) may be expressed in matrix form as, 

Y = Fa + e, 

where 

aT = (V a i a v-i> 


( 1 . 2 ) 

(1.3) 

(1.4) 

(1.5) 

( 1 . 6 ) 

(1.7) 

( 1 . 8 ) 

(1.9) 

( 1 . 10 ) 
(l.H) 
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and 



,T = (%• <v 

••• ’ e N-l) 


(1.12) 

W 

p i<'o> 

P 2 (*«> 

P y- 1^0^ 


p o(ti) 

p ,(',> 

P 2 ( t 1 ) 

K- ,(t,) 

(1.13) 

_ P 0 ( Vl) 

P I<V.) 

P 2 ( ^N~ 1 ) ‘ ’ ' 

P !^- 1 ( t N“ 1 ) 



Equation (1.4) may be written as 


and equations (1.5) and (1.6) as, 


where W is the diagonal matrix, 


E(e) = 0 


E(ee T ) = W~ 1 . 


W = 


w o 0 0 


0 Wj 0 


0 0 w„ 


0 0 0 


0 

0 


N- 1 


(1.14) 


(1.15) 


(1.16) 


The problem may now be stated as: Can we find a vector Z such that 

Z = Ga + e , (1.17) 


where 


Z T = (z 0> Zj, ... z y _ l)t 
eT = ^ e 0’ e V • • • e v - l ) • 


(1.18) 

(1.19) 
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and 



P o( T o) 

P i( r o) 

p 2 ( r 0 ) • 


G = 

P o( r i) 

P l^l> 

p^l) • 

■ ■ P v-l( T 0> 


rH 

1 

• • w 

D^| 

P l(Vl) 

p 2 0*-i) ■ 

' ' P v-l(Vl> 


and where 

E(e) = 0 
E(e e T ) = V" 1 

for some diagonal matrix. 



( 1 . 20 ) 


( 1 . 21 ) 

( 1 . 22 ) 


(1.23) 



2.0 Solution of the Problem 

In order to estimate the vector we proceed in the usual manner by obtaining 
a weighted least squares solution, i.e. we minimize the quantity 

(Y- Fa) T W(Y - Fa), 

where a denotes our estimate of a. 

Letting the partial derivative of the above quantity with respect to a vanish 
we deduce that 

F T WY= (F T WF)a. 


5 



It follows from the above and equation (1.10) that 


a = a +(F t WF)' 1 F T We 


( 2 . 1 ) 


Hence if we write, 


Z = Ga, 

it follows from the above and equation (1.17) that 

e =G(F t WF) _ 1 F T We (2.2) 


Since E(e) = 0 it follows that 


E(e) = 0, 


(2.3) 


i.e. equation (1.21) is satisfied. 

From equations (2.2) and (1.15) we deduce that 

E(ee T ) =G(F t WF) _ 1 F T WE(ee T )WF(F T WF)- 1 G T 

= G(F t WF) _1 F t WW" 1 WF(F t WF)- 1 G t 
= GCFTWF)- 1 G t (2.4) 

It will be shown in the next section that there exists a set of polynomials I> (t), 

?! (t). . . • Pj (t) with the property that 

N~ 1 

y P; (t K ) Pj(t K ) \ = S. . , for i, j - o, 1, ... N - 1 ( 2 ’ 5 ) 

k= o 

where S^is the Kronecker delta satisfying S. = 0 if i 4 j, and S. . = 1 if i = j. 
These polynomials further have the property that their roots are ail real and 
distinct. If the roots of V v (t) = 0 are arranged in ascending order and are denoted 
by Tj , . . . j, then 


t < T n < r < , 


< t < t 
v-l N- 


Equation (2.5) may be expressed in matrix form as 


( 2 . 6 ) 


F t WF= I, 


(2.7) 



where I is the identity matrix. The polynomials also have the property that 

GG t = V _1 , ( 2 * 8 ) 

where V is a diagonal matrix. G being defined by equation (1.20) with t q , 
t -r _ i being the roots of P^ (t) = 0. 

It thus follows from equations (2.4), (2.7) and (2.8) that 

E(ee T ) = V 1 , (2.9) 


i.e. equation (1.22) is satisfied. 

In the next section we shall demonstrate that the polynomials Pj (t) with the 
desired properties do indeed exist. 


3.0 Definition of Orthogonal Polynomials P n (t) 


3.1 Definition of Polynomials P n (t) 

Let us define the polynomial P n (t)* of degree n in t by the determinant 


H 0 Hl H 2 . . H 

H x H 2 H 3 . . . H 





where 


H 


H_ H 


n-l “n n + 1 ' ' “2n~l 


H„ 


t 


t n 


N~ 1 




w T , 


K= 0 


(3.1) 


(3.2) 


- 1 — _ 

* It is important to note that the polynomial defined is P n (t) and not P n (t). The relationship 
between these two polynomials is given in section 3.2.2 equation 3.24- 
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If we denote the cofactor of t 1 in the determinant for P n (t) by C n t then 
equation (3.1) may be written in the form 


p (t) = y -1 C . t 1 (3.4) 

i=0 

3.2 Some Properties of the Polynomials P n (t) 

3.2.1 Proof that C >0 

n , n 

We shall first establish that for 0 < n < N, 


C > 

n , n 


By definition, C n n is given by the determinant 


Ho H, 


H 


n~ 1 


H, 


H 2 . . . H„ 


(3.5) 


Let us define the n x n matrix A, as the matrix whose components ] are 
given by 


N~ 1 

A ii = £} Wi < 3 - 6 > 

K = 0 

A comparison of definitions (3.2) and (3.6) shows that 


H. , . = A . 

i + j i ] 


It hence follows from equation (3.5) that 


(3.7) 


C = | A | (3 * 8 ) 

n , n 1 1 

To show that | A | > 0 it is sufficient to show that the symmetric matrix A is 
positive definite, i,e. that for any real non-zero vector x, 


x T Ax > 0 
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If we define the (n x N) matrix E, as the matrix whose components E. K are 
given by 


E 


iK 




then by definition 


A = EWE t 

We thus have to demonstrate that for any non-zero x 

x t EWE t x > 0, 


(3.9) 


(3.10) 


i.e. that (x T E) W (x T E) T > 0. 

Since W is a diagonal matrix whose diagonal elements are greater than zero 
the above is certainly true provided that x T E 4 0 for any real non-zero vector x. 

In other words, we have to show that the r rows of the n x N matrix E are 
linearly independent, i.e. we have to show that following vectors are linearly 
independent, 


( 1 , 1 , • • • 1 ) 
(V t i> ■ • • t N - J ) 
( t o < t i’ ' ‘ • ^-i) 




n~ 1 


h-n- 1 


t n-l 

N-l 


) 


Since n ^ N it is clearly sufficient to show that the vectors 


( 1 , 1 , ... 1 ) 

(to, tj, ... t n _ 1 ) 


(t n_1 t n_1 
( 0 > L 1 ■ 


t n ' 

n- 


1 

1 


) 


are linearly independent. 
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But the determinant of the above vectors is given by (see Appendix A) 


(*, - VCt, - V CVl - V - fc l) ■ ' • (Vl " ' (Vl ~ Vj) 

Since by assumption all the t’s are different none of the terms appearing 
in the product can vanish. Therefore the determinant cannot vanish and thus 
the rows of the matrix E are linearly independent from which it follows that 


>0 for n = 0, 1, . . . N 


(3.11) 


3.2.2 Orthogonal Properties and Definition of (t) 
Let d(l^) denote the vector, 


^(Sc) t K’ t K’ ’ - 


t") for K= 0. 1,... N - 1 


and f , the vector 


f s = (H s . H s+1 , ... H s+n ) for S = 0 , 1 ,... n - 1 
We deduce from equations (3.2), (3.12) and (3.13) that 


(3.12) 


(3.13) 


f 

S 


N- 1 


t K W K d < t K> f ° r 


K = 0 


It follows from equations (3.1) and (3.13) that 


s = 0, 1, . . . n - 1 


(3.14) 


W = 


r 


n- 1 


d(t) 


Since each of the first n rows of the above determinant is a linear combina- 
tion (given by equation (3.14)) of the N vectors d(t Q ), d(t j ) , . . . d(t N j ) it follows 
that they cannot be linearly independent if n exceeds N, i.e. 
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P (t) = 0 for n>N 


(3.16) 


Similarly 


P N ( tK ) = 0 for K = 0, 1, •• • N 
Equations (3.4) and (3.17) imply that 

P N ( t ) = C n > n( t - t 0)( t - t l) --Ct-t N _ 1 ) 

It follows from equations (3.14) and (3.15) that 


Hence 


Also since 


N“ 1 

L 

K = 0 




n- 1 


N" 1 

E 

K = 0 


WnCV = 0 if s <n 


= C 


n+ 1 , n + 1 


f 

n 


since it is the cofactor of t n+1 in the expansion of P n+1 (t) it follows from 
equation (3.19) that 


N- 1 


E 

V = A 


t n w P ft 1 = C 
K K n' K' ■ 


n + 1 , n + 1 


(3.17) 

(3.18) 

(3.19) 

(3.20) 


(3.21) 
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It follows from equation (3.4) and (3.20) that 


p s ( t K ) = 0 if s f n ( 3 - 22 ) 

K = 0 

[The inequality sign follows since by symmetry equation (3.22) must hold both 
for s < n and n < s] . 

From equation (3.4), (3.20) and (3.21) we deduce that 


N" 1 


L 

v = n 


P n(V VW 


c c 

ri , n ri + 1 , n + 1 


(3.23) 


Since by equation (3.11) C n n and C n+ are positive, it follows from the 
above that we may define a polynomial P n (t) by 


PnCt) 


PnCtV^C 


n , n ^n+ 1 , 


n+ 1 


for n = 0, 1, • N - 1 


(3.24) 


We may now deduce from equations (3.22), (3.23) and (3.24) that 


N- 1 

2^ P i (t K)Pj( t K> W K = S ij- for j = 0, 1 N - 1 (3.25) 

K= 0 

[We have thus shown that the polynomials Pi (t) satisfy equation (2.5) — which is 
the same as (3.25)] . 


3.2.3 Proof That the Zeroes of P n (t) are Real and Distinct 

We shall next show that the zeroes of a polynomial P n (t) are real, distinct 
and lie in the range t < t < t N . The proof given here follows that given by 
Hildebrand [1] for orthogonal polynomials. Since any polynomial g i ( t ) of degree 
i may be written as a linear combination of polynomials P 0 , P ls ... P. it follows 
from equation (3.25) that 

k- 1 

J" 1 gi(t K ) Pj(t K ) w K = 0 for i < j (3.26) 

K= 0 
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Assume that Pj (t) has complex zeroes. Since they appear in complex conjugate 
pairs there must be at least two of them x and x say. Then let g t (t) = Pj (t)/{t - x) 
(t - x) which is a polynomial of degree j - 2. It then follows from equation (3.26) 


that 


N ~ 1 

J2 >/(',< - - *> = 0 

K = 0 

but this is clearly impossible since all the quantities in the sum are positive. 
(Remember that (t K - x) (t K - x) = t i 2 + |x | 2 . A similar argument leads to the 
conclusion that all zeroes are distinct [if two zeroes are coincident and equal 
to x, say we choose g^t) = P (t) / (t - x )2 ]. 

If a zero is less than t Q or greater than t l and equals x say, then we 
choose g. (t) = P (t)/(t - x) and apply the same argument to show that equation (3.26) 
cannot possibly be satisfied. 

We thus conclude that all roots are real, distinct and lie in the range to 


3.2.4 A Recurrence Relation Between the Polynomials P t (t) 

(The derivation given here is based on that given by Hildebrand [2] .) 
According to equation (3.4) we have 


Hence, 


n + 1 


P n + l(t) = Z] Cn+1> 1 11 and P n (t) = 7"! Cn.jt 1 

i = 0 


i = 0 


Cn.nPn + iCt) ^n+1 , n+1 tP„Ct) 


n 

z: 

i - 1 


^n.n^n + 1, 


- c 


n + 1 , n + 1 




t 1 


+ ^n,n^n+l,0 


The right-hand side, whose leading term is (C C , . - C . , . , C , ) t n , 

° 9 ' n,n n+l,n n+1, n+1 n,n~l 7 * 

is a polynomial of degree n. Hence it may be written as a linear combination 
of polynomials P ; (t) of order n and less. We may, therefore write 


n - 1 


c [c p + 1 (t)-c +1 + 1 tp(t)] =(c c . . -c +1 .,c .iprtj+y -1 h.p.(t') 

n,n n,n n + 1 v ' n+1, n+1 n v ' v n , n n + 1 , n n+1, n+1 n,n~l 7 n'' 7 / i i V / 

i = 0 

(3.27) 
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where the coefficients h. are to be determined. Since by equation (3.20) 


n - 1 

J2 ftw = ° 

K= 0 

for s = 0, 1, ... n - 1 we deduce from the above that h Q =h 1 = ... = h n _ 2 =0 
[we do this by successively performing the summation implied by equation (3.20) 
for s = 0, 1, . . . , n - 2] . Performing the summation with s = n - 1, we obtain 


N- 1 


N- 1 


^n.n^n + l, n+ 1 ^ ^n~ 1 Sc 


). 


K = 0 


K = 0 


whence by equation (3.21) h n _ t = - (C n+1 n+1 ) 2 . Equation (3.27) therefore reduces 
to 


(c y 

v n,n / 


(3.28) 


Equation (3.28) holds true for all values of n, provided that we define P_ x (t) 
to be identically equal to zero. Note that the equation becomes meaningless for 
n > N since in that case all terms will vanish. 


3.3 The Existence of an Equivalent Set of Points 

We shall show that there exists a set of v points (where v <, N) T 0 • T 1 * • • • T v - l 
and associated with these points a set of non-zero positive numbers v 0 , v x , . . . v v _ x 
such that 


V - 1 


N" 1 




‘i W K 


for i = 0, 1, ■ • • v - 1 


fJ- = o 


K= 0 


(3.29) 


If equation (3.29) were true then it would follow from equations (3.1) and (3.2) 
that we could construct our polynomials P. (t) (for i = 0, 1, . . . v) over the set 
( T o * r i» • • • r v ) with associated set of weights v 0 » v x , . . . v v . But then it 
would follow by analogy with equation (3.17) that 


p v( r /x) - 0 for M = o, 1. • •• V - 1 


(3.30) 
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In other words (r 0 , r 1 , . . .r v _ x ) are the zeroes of the polynomial P u (t). 

In Section 3.2.3 we established that the zeroes of a polynomial P. (t) are all real 
and distinct and lie in the range t Q < t ^ t N _ x . If we denote the smallest zero 
by r Q> the next one by Tj, etc. it follows that, 




<r 1 <r 2 


< T 


v-1 


<, t 


N“ 1 


(3.31) 


Next we are going to show that if the r’s are the zeroes of P v (t) then only 
v of the 2 v equations (3.29) are independent. In other words we shall show that if 


V - 1 V - 1 

J~\ T l\ = y] for i = o, 1 ,... V - 1 ( 3 * 31 ) 

M=0 K=0 

then the remainder of equations (3.29) will follow. The proof is by induction. 

Let us assume that 


N- 1 




t 1 w 
K K 


for i = 0, 1, . . - s - 1 


= 0 


K=0 


(3.32) 


where v <> s < 2v , The righthand side of equation (3.32) by definition (3.2) equals 
H t . The last v of the above equations may then be written as 


Multiplying 
deduce that 


V- 1 




v r" 1 = H. , where i = s - v, 

j + m 


M M M 


M = 0 


both sides of the above equation by C 


v , m 


and m = 0, 1, . . . v - 1( 3 * 33 ) 
and summing over m we 


V - 1 


V-1 


V- 1 


/ riy y 1 C r m = y c H. 

/ , M H ( v,m fj. / | v,m j 


fJL~ 0 


m ~ 0 m = 0 


+m 


Since by equation (3.4) 


(3.34) 


P (r ) = 

V V LL' 


V 


T 

m fjL 


m = 0 


and since r is a zero of P (t) it follows that 


V - 1 

L 

m - 0 


C T m — — C T V 

V ,m fJL V , V fJL 


(3.35) 
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Since C v m is the cofactor of t m , the (m + 1) th element in the (v + l)th row 
in the determinant defining P v (t), and since H i+m is the (m + l)th element in the 
(j + l)th row it follows from the theory of determinants that (provided that j < v ) 


m=0 


and hence that 


V - 1 


H c - H i 


_ p IT 

+m j+v 


m= 0 


(3.36) 


It follows from equations (3.34), (3.35) and (3.36) that 


V - 1 

L 

n = 0 


T 1 V T 
M K M 


v - H. , for ] < v 

J + V J 


(3.37) 


We have thus shown that equation (3.33) is valid for m = v . It follows by 
induction that equations (3.29) may be deduced from equations (3.30) and (3.31). 
By an argument identical to the one used at the end of para. 3.2.1 above it follows 
that the v x v matrix of coefficients t 1 has a non-zero determinant so that the 

fjL 

matrix is non-singular. Hence we can use the set of v linear equations (3.31) 
to solve for v 0 ,v 1 ,...,v i/ _ 1 . We have thus demonstrated the existence of 
a set of points (r A , r r \ and the j associated numbers v , v , . . . v ) 

'01* v- r * 01 v~\ 

which satisfy equation (3.29). 

Equations (3.24) may be written in the equivalent form 


V- 1 


N- 1 


q i ( V = 2Z qi(t K } W K for i = 0. 2v - 1 


\J- = 0 


K= 0 


(3.38) 


where q A (t) is any ith degree polynomial in t. In particular, let us choose q . (t) 
P m (t) P n (t) where m + n = i. Then 


v - 1 


N“ 1 


/ P (t ) P (r ) v = V” 1 P ft ) P (t ) W f 

/_ , n v y.> n / nA K ' K 


or m, n = 0, 1, . . . v - 1 


M = 0 


K = 0 
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But by equation (3.25) the righthand side of the above equation may be written 
as S it follows that 

mn 


v- 1 

J"! P J T ^ P n( r M ) V M = S n,n f ° r “>• 11 = 0, 1, . . . V - 1. 
a = 0 

In matrix form this may be expressed as 


(3.39) 


G T VG = I, (3.40) 

where I is the v x v identity matrix, G is defined by equation (1.20) and V by 
equation (1.23). 

Inverting both sides of equation (3.40) we obtain 

G" 1 V- 1 (G T )" 1 = I. 


Hence 


GG T = V' 1 (3.41) 

which is the same as equation (2.8). Thus we have derived all results which we 
set out to obtain. 


3.4 Some Additional Properties of the Polynomial P £ (t) 

In non-matrix notation equations (3.41) may be written as 

V- 1 

y -1 p i CO P.(r ) = v _1 8 (3.42) 

/ i v m / i v n / m mn 

i = 0 

if m = n equation (3.42) becomes 


V - 1 

(3 - 43) 

i = 0 

One consequence of equation (3.43) is that the set of polynomials P Q (t), P (t), . . . 
P v _ It) cannot have a common zero. Since we may deduce from equation (3.28) 
that if two polynomials, whose degrees differ by one, have the same zero, then 
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all polynomials have the same zero. But since this is impossible we conclude 
that two polynomials, whose degree differ by one cannot have the same zero. 

In general, we can now say that the zeroes of the polynomial P n (t) cannot 
equal the zeroes of P n+ , (t) and are bounded above and below by the largest 
and smallest zero of P n+1 (t). 


3.5 Relationship to Gaussian Quadrature 

If the set of points t 0 , t lf . . . t N _ x is infinite the sum on the righthand side 
of equation (3.38) gets replaced by an integral. We thus obtain 

i 1 C b 

= I i ( t ) w ( t ) dt . for i = 0, 1,... 2v - 1, ( 3 * 44 ) 

where g t is any ith degree polynomial in t. Equation (3.44) tells us that we may 
approximate the above integral by a finite sum. 


3.6 Uniqueness of the Orthogonal Polynomials 

We shall show that the polynomials P. (t) are the only ith degree polynomial 
having the property that. 


N- 1 


L 

v = n 




S. . for 


j = 0, 1,... N - 1 


(3.45) 


Suppose 


N 


L 

K = 0 




8 . . 

1 ) 


(3.46) 


Since Qi(t) is an ith degree polynomial in t we may express it as a linear combi- 
nation of P Q (t), Pj (t), . . . Pj (t). Thus there exists a set of numbers h ) such 
that 


Q i (t) = h 0 P Q (t) + hjPjft) + ... h.P^t) 


(3.47) 
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It follows from equations (3.45) and (3.47) that 


N~ 1 

for i = o, l i (3 - 48) 

K = 0 

But equation (3.46) implies that for any polynomial R. (t), 

n- 1 

2] E i< V \ = 0 if i <i 

K - 0 

We hence deduce from equation (3.48) that h Q = h 1 = ...h i _ 1 =0, whence 
equation (3.47) becomes 

Qi(t) = h.PjCt) (3.49) 

Substituting the expression for Qi(t) from equation (3.49) on the lefthand side 
of equation (3.46) (with j = i) we obtain, 


N- 1 

h? [Pj (t K )] 2 w R = 1 (3.50) 

K= 0 

We hence conclude from equations (3.50) and (3.45) that 



h2 _ j (3.51) 

Thus Q j(t) = ± Pi (t) for i = 0, . . . N - 1. The statement that Pi (t) is 
unique is, therefore, not absolutely true since the sign is arbitrary. 


The fact that the orthogonal polynomials are unique implies that if the set of 
points t Q , t , . . . t N 1 are evenly distributed in the range (-1, 1) and if W = l/N, 
then our polynomials P n (t) will degenerate to normalized Legendre polynomials 
as N-co . 

4. The Effect of Using a "Fitting" Polynomial, which is One Degree Too Low 

In this section we shall investigate the effect of the components of the data 
vector being of the form. 
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(4.1) 



S. = a P (t. ) 

= 0, inaccordance with equation (3.30) 

Therefore, 

Z = Ga + e ( 4 ' 6 > 

Corresponding to equation (2.2) we then have 

e = G(F T WF) -1 F t W (M + e) ( 4 * 7 ) 

The (i + l)th component of the v -vector F T w M is given by, 

N~ 1 

<*"■«>,=£ F Ki»K«K 
K = 0 
N~ 1 

= / P. (t i w a P (t ) 

! 1 V K' K v v' K ' 

K = 0 

= a v S iv > by equation (3.25) ( 4 * 8 ) 
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Since by equation (2.7) F T W F = I, it follows from equation (4.7) and (4.8) 
that the (j + l)th component of e is given by 


V- 1 

e. = (GF T We). + Y G. a S. 

1 K / JK v iv 

i = 0 

The sum on the righthand side of the above equation vanishes since i is always 
less than v. We therefore conclude that 

e = GF T We, < 4 * 9 ) 

which is the same as equation (2.2) [since F T WF = I by equation (2.7)] . 

In other words "underfitting" (i.e. the effect of assuming that the given data 
Y can be described by a polynomial of degree v - 1, when in reality a polynomial 
of degree v is required) does not by itself introduce an error in the compressed 
data points. On the other hand, some information loss obviously does result. 


5.0 The Zeroes of P s (t) are Separated by the Zeroes of P gi (t) 

Proof: First we shall show that between each pair of points t s and t S+1 lies 
one and only one zero of P N _ x (t). We may deduce from equation (C.5) in 
Appendix 3 that the sign of P N _ i (t s ) is positive if the number of terms of the 
form (t s - t K ), where k > s, is even. In other words P (t s ) is positive if 
(N - 1) - (S + 1) + 1 is even, i.e. 

P N -i(t s ) is positive if (N - 1 - S) is even. 

It follows that P N _j (tj and P N _j(t S+J ) must be of opposite sign. Since P ] (t) 
is a polynomial and hence a continuous function we deduce that P (t) must 
have a zero between t , and t . It thus follows that P (t) must have a zero 

S+1 JSJ- 1 ' 7 

between t Q and t lf t } and t 2 , . . . , t N _ 2 and t N _j. Since the N - 1 zeroes of 
P N _ ! (t) are thus accounted for, it follows that P N _ 1 (t) has one and only one zero 
between any pair of points t 0 and t s+1 . Since the points t Q , tj, . . . t N+1 are 
the zeroes of P N (t) we have shown that the zeroes of P (t) are separated by 
the zeroes of P N _ L (t). In Section (3.3) we showed that the set of polynomials 
P 0 (t), P x (t), . . . P s _j (t) may be constructed on the zeroes of P s (t). Hence the 
result follows. 
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APPENDIX A 


The Determinant D Whose Elements d. . are Given by d. . = t l 

1 j J 1 j j 

is Equal to the Product of all Terms of the Form t A - tj , where i > j 
(If D is an N x n determinant then its elements are given by d . . 

where i, j = 0, 1, . . . N - 1) J 


Proof: By definition 


1 1 1 ... 1 


D = 



(Aol) 


t N-l t N~l «-N-l f N-l 

0 1 2 ••• Vl 


Since the determinant vanishes if any two columns are identical it will vanish 
if any of the t. T s are equal. Hence D must be of the form, 


D = KP 


(A .2) 


where 


p =TT (A - 3) 

i=l j=0 

and where K is some polynomial in the t. 's. From equation (A.l) we see that 
D is a polynomial of degree (1 + 2 + . . . N - 1) in the t. T s. But the degree of the 
polynomial P is the same, since each term in the product is of degree 1 and the 
number of terms is (1 + 2 + . . . + N - 1). It then follows from equation (4.2) 
that K must be of degree zero, i.e. K is a constant. 

To show that K = 1 we look at the coefficient of tj 1 1 1 1 . . . t J both in 
D and in P. It is easily seen that there is only one term of the form t t 2 t 3 . . . 
tj}l{ in equation (A.l). This is the product of the diagonal elements. The 3 
desired coefficient in D thus equals 1. 
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Since, 


vm- c:= IT Tk 

i=l j=0 

it follows from equation (A.3) that the desired coefficient in P also equals 1. 
Hence K must equal 1. Therefore D = P, i.e. 

D= IT (A-4) 

i=l j=0 
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APPENDIX B 


The Expression of the Polynomial P N (t) as a Product 


In accordance with equation (3.1), P (t) is given by 







H 1 H 2 • ^ 
^2 ^3 ‘ ‘ ^N+l 


^N-l ^N ^N+l ' ‘ ^ 2 N -1 

1 t t 2 ... t N 


where 

N-l 

"i =£ 

K=0 

Let us define the N x N matrices A, B and W by, 


1 

1 

1 

.. 1 

1 


1 

1 

1 

.. 1 

0 

*0 

s 

fc 2 ■ 

‘ ‘ t N~ 1 

t 

B = 


tl 

t 2 • 

- t N -l 

0 

f-N-1 

0 

f-N-1 

L 1 

fN-1 

2 

t-N-1 
' N-l 

t N~l 


t N~l 

4-N-l 

1 

tN-l 

2 

4-N-l 

■ N-l 

0 

TN 
_ 0 

tN 

.. 

. t N 
N-l 

t N 


0 

0 

0 . 

• 0 

1 


and 


w Q 0 0 ... 0 0 

0 Wj 0 ... 0 0 

0 0 0... w N _ x o 

_0 0 0 ... 0 !_ 
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(B.l) 


(B.2) 


(B.3) 


(B.4) 



It is easily verified that 


P N (t) = | b t w a| 

= |b| | W I |A| (B.5) 

since the determinant of the product of square matrices equals the product of 
their determinants. 

Since W is a diagonal matrix it is easily seen that 


N-l 


w 


TT». 


i = 0 


It follows from equation (B.3) and the result of Appendix 1 that 


' N" 1 i-1 


i=l j=0 


<*1 "V 


N-l 


K=0 


(* - O 


and 


(B.6) 


(B.7) 


N-l i-1 


Ibi = TT TT (t- - 1. 

' v ! J 


) 


i=l j=0 

Hence we deduce from equations (B.5), (B.6), (B.7) and (B.8) that, 


(B.8) 


r N-l i-1 ‘ N-l 

p x (t) HIT TT <*» - TT v*-*, 


i=l j=0 


K=0 


(B.9) 
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APPENDIX C 


The Expression of P^_ 1 (tj as a Product (for s = 0,l,...N-l) 


By definition (3.1), 



«o 

Hx 

h 2 .. 

•• Vi 




h 2 

H 3 • 

■■ Hn 


P N- l^i ) 

Hn-2 

Hn-i 


IJ 

* 2N-3 

(C.l) 


1 

t. 

t 2 . 

s 

t N“l 

s 


where 

Hi 

N~ 1 

■L 

K=0 

t* W 
K K 


(C.2) 

If we now subtract (w t‘) times the last row from the (i + l)th row the value of 
the determinant will remain unchanged. We then obtain. 


H o 

»; 

H' • 

•• K-i 


P N“ l^j ^ 

K 

H 2 

H' .. 

• % 

(C.3) 


K-2 


% • 

■ ■ H 2N-3 



1 


t 2 

s 

t N ” 1 

s 


where 

H' 

1 

N" 1 

-L' 

K=0 

K^s 

; * w 
K K 


(C.4) 
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Comparing equations (C.3), (C.4) with equations (B.l), (B.2) and (B.4) in 
Appendix B we deduce that 






(C.5) 
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